Adversarial Grammatical Error Generation: Application to Persian Language

نویسندگان

چکیده

Grammatical error correction (GEC) greatly benefits from large quantities of high-quality training data. However, the preparation a amount labelled data is time-consuming and prone to human errors. These issues have become major obstacles in GEC systems. Recently, performance English systems has drastically been enhanced by application deep neural networks that generate synthetic limited samples. While extensively studied languages such as Chinese, no attempts made for improving Persian Given substantial grammatical semantic differences language, this paper, we propose new learning framework create enough sentences are grammatically incorrect A modified version sequence generative adversarial net with policy gradient developed, which size model scaled down hyperparameters tuned. The generator trained an on dataset 8000 Our proposed achieved bilingual evaluation understudy (BLEU) scores 64.5% BLEU-2, 44.2% BLEU-3, 21.4% BLEU-4, outperformed conventional supervised-trained long short-term memory using maximum likelihood estimation recently labeler machine translation augmentation. This shows promise toward generating

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Grammatical Error Correction of English as Foreign Language Learners

This study aimed to discover the insight of error correction by implementing two correction systems on three Iranian university students. The three students were invited to write four in-class essays throughout the semester, in which their verb errors and individual-selected errors were corrected using the Code Correction System and the Individual Correction System. At the end of the study, the...

متن کامل

Adversarial Generation of Natural Language

Generative Adversarial Networks (GANs) have gathered a lot of attention from the computer vision community, yielding impressive results for image generation. Advances in the adversarial generation of natural language from noise however are not commensurate with the progress made in generating images, and still lag far behind likelihood based methods. In this paper, we take a step towards genera...

متن کامل

Adversarial Ranking for Language Generation

Generative adversarial networks (GANs) have great successes on synthesizing data. However, the existing GANs restrict the discriminator to be a binary classifier, and thus limit their learning capacity for tasks that need to synthesize output with rich structures such as natural language descriptions. In this paper, we propose a novel generative adversarial network, RankGAN, for generating high...

متن کامل

Automated Grammatical Error Correction for Language Learners

A fast growing area in Natural Language Processing is the use of automated tools for identifying and correcting grammatical errors made by language learners. This growth, in part, has been fueled by the needs of a large number of people in the world who are learning and using a second or foreign language. For example, it is estimated that there are currently over one billion people who are non-...

متن کامل

Artificial error generation for translation-based grammatical error correction

Automated grammatical error correction for language learners has attracted a lot of attention in recent years, especially after a number of shared tasks that have encouraged research in the area. Treating the problem as a translation task from 'incorrect' into 'correct' English using statistical machine translation has emerged as a state-of-the-art approach but it requires vast amounts of corre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International journal on natural language computing

سال: 2022

ISSN: ['2278-1307', '2319-4111']

DOI: https://doi.org/10.5121/ijnlc.2022.11402